# Low-latency audio processing
Voila Chat
MIT
Voila is a brand-new large-scale speech-language foundation model series designed to elevate human-computer interaction to unprecedented levels.
Text-to-Audio
Transformers Supports Multiple Languages

V
maitrix-org
2,423
32
Sanji
This is a real-time voice conversion (RVC) model named 'Sanji', designed for audio-to-audio conversion tasks.
Speech Synthesis
Transformers

S
sail-rvc
208
0
Ai Light Dance Stepmania Ft Wav2vec2 Large Xlsr 53 V1
Apache-2.0
This model is an automatic speech recognition model fine-tuned from wav2vec2-large-xlsr-53 on the GARY109/AI_LIGHT_DANCE - ONSET-STEPMANIA2 dataset.
Speech Recognition
Transformers

A
gary109
48
0
Featured Recommended AI Models